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The meto-cleavage pathway for catechol is one of the major routes for the microbial degradation of aromatic 
compounds. Pseudomonas s^. strain CF600 grows efficiently on phenol, cresols, and 3,4-dimethyIphenol via a 
plasmid-encoded muluVomponent phenol hydroxylase and a subsequent mefe-cleavage pathway. The genes for 
the entire pathway were previously found to be clustered, and the nucleotide w -?uences of dmpKLMNOPBC and 

D, which encode the first four biochemical steps of the pathway, were determined. By using a combination of 
deletion mapping, nucleotide sequence determinations, and polypeptide analysis, we identified the remaining 
six genes of the pathway. The fifteen genes, encoded in the order dmpKLMNOPQBCDEFGHI, lie in a singe 
operon structure with intergenic spacing that varies between 0 to 70 nucleotides. Homologies found between the 
newly determined gene sequences and known genes are reported. Enzyme activity assays of deletion derivatives 
of the operon expressed in Escherichia coli were used to correlate dmpE, G, H , and / with known m«?£a-cleavage 
enzymes. Although the function of the dmpQ gene product remains unknown, dmpF was found to encode 
acetaldehyde dehydrogenase (acylating) activity (acetaldehyde:NAD + oridoi-eductase [coenzyme A acylatingl; 

E. C.1.2.1.10). The role of this previously unknown mtfa-cleavage pathway enzyme is discussed. 



The central role of catecholic intermediates in aerobic 
microbial degradation of aromatic compounds is well estab- 
lished. Catechol (1,2-dihydroxyhenzene) itself is an interme- 
diate in the degradation cf compounds such as benzoate, 
naphthalene, salicylate, and phenol, and substituted cate- 
chols are intermediates in the catabolism of methylated and 
chlorinated derivatives of these compounds (13, 34). A 
diverse array of enzymes can be elaboiated to convert 
aromatic compounds to central catecholic intermediates. 
However, the reactions used for oxygenative ring fission of 
the catechol and the subsequent conversion to Krebs cycle 
intermediates are limited to one of two metabolic alterna- 
tives: those of the ortho- and m*/a-cleavage pathways. The 
<?rf/i0-cleavage pathways involve ring cleavage between the 
two hydroxyl groups followed by a well-defined series of 
reactions leading to 0-ketoadipate (reviewed in reference 
13). The alternative me/a-cleavage pathway involves ring 
cleavage adjacent to the two catechol hydroxyls, followed 
by degradation of the ring cleavage product to pyruvate and 
a short-chain aldehyde (Fig. 1). The use of one pathway or 
the other is dependent upon the microbial species and/or the 
nature of the growth substrate. 

The mera-cleavage pathway was first studied in Pseudo- 
monas strains that can grow at the expense of phenol and 
cresofc (14, 29). Since then, the role of the mefci-cleavage 
pathway in aromatic biodegradation by bacteria of many 
genera, including species of Azotobacter and Alcaligenes 
and numerous species of Pseudomonas, has been demon- 
strated (2, 13, 23, 36). In addition, reactions of the lower part 
of the pathway are involved in the degradation of phenyl- 
propionates by Escherichia coli (8). From this variety of 
sources, a wealth of information has accumulated regarding 
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pathway chemistry, gene organization and regulation, and, 
to a lesser extent, gene structure and enzymoiogy. 

The most comprehensively studied me/a-cleavage path- 
way is that of the IncP-9 TOL plasmid pWWO, which 
encodes a toluene degradation pathway of Pseudomonas 
putida. "Tie me/a-cleavage pathway genes are located in an 
operon that encodes the enzymes for conversion of ben- 
zoate, via catechol, to central metabolites; a separate operon 
encodes the enzymes required to convert toluene and xy- 
lenes to the corresponding benzoates. The TOL meta- 
cleavage operon comprises 13 structural genes, of which 2, 
xylT and xylQ* have no known function (19). The remaining 
genes encode the enzymes required for the conversion of 
benzoate to catechol and then to pyruvate and acetaldehyde 
via the reactions shown in Fig. 1. The enzymes of the 
pathway from the ring fission enzyme downward have been 
studied to various extents. Whereas catechol 2,3-dioxygen- 
ase, for example, has been the subject of many studies, other 
enzymes, such as 4-hydroxy-2-oxovalerate aldolase phi' 
2-hydroxymuconic semialdehyde dehydrogenase, have ap- 
parently not even been purified. Many features of the TOL 
m?ro-cleavage pathway gene organization and enzyme func- 
tion are preserved in other aromatic catabolic pathways in 
Pseudomonas species (for a review, see reference 2). 

The m?ftz-cleavage pathway also functions in the degrada- 
tion of phenols and methyl-substituted phenols by a number 
of different Pseudomonas species (14, 26, 37, 40). Pseudo- 
monas sp. strain CF600 can grow efficiently with phenol, 
cresols, or 3,4-dimethylphenol (3,4-dmp) as the sole carbon 
and energy source (40). The catabolic pathway for these 
substrates is encoded on pVI150 an IncP-2 megaplasmid, 
and involves hydroxylation followed by a mefd-cleavage 
pathway. The genes for the enzymes of this pathway were 
previously found to be clustered, and the nucleotide se- 
quences of the genes involved in the first four biochemical 
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FIG 1 The PVU50 plasmid-encoded catabolic pathway for the 
diss milation ofphenol and its methylated ^7^* 
structural genes are denned in the text Compounds. L phenoU 1 
catechol* HI, 2-hydroxymucomc semialdehyde, IV 4-oxaiocroio 

tonate keto fonii (2-oxohex-3-ene-l,Mioate); VI. 2-oxopent-4- 
SwT4-hyd ro xy-2H>xovale ra te;VIU .acetaldchyde Enzymes: 
1 ohenol hydroxylase (PH); 2. catechol 2,3-dioxygenase (C230). j. 
hKomSK hydrolase <2HMSH):4. 2-hydroxy- 

mSKLldehyde dehydrogenase (2HMSD); 5, ^xalocroto- 
nate isomerase (401): 6, 4-oxalocrotonate decarboxylase (40D). 7. 
2?xopSe^ate hydratase (OEH); 8. ^yd^y^oxovalera e 
aldolase (HOA); 9, acetaldehyde dehydrogenase (acylaling) (ADA). 
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steps of the pathway. dmpK, L,M, .V. O. P,B,C. and D, 
have been determined (3, 4, 30, 31). 

The dmpKLMNOP ge"es are involved in the conversion 
of phenol to catechol. Oiay the polypeptide products of the 
dmpLMNOP genes, namely. PI, P2, P3. P4, and P5 ut 
required for in vitro activity of a multicomponent phenol 
hydroxylase (33). The function of PO, the product of dmpK, 
is unknown. Although PO is not required for in vitro pheno 
hydroxylase activity, it is lequired in vivo, together with PI 
throuah P5, for growth on phenol by a pseudomonad that can 
catabolize catechol (30). The dmpKLMNOP genes are ar- 
rayed in an operon structure with no more than 70 bp 
between any two genes. 

The dmpB, C, and D genes encode enzymes (Fig. 1) that 
have analogs in mew-cleavage pathways for catabohsm of a 
variety of aromatic compounds. The gene order of dmpB C 
and D is identical to that of the analogous genes of the TOL 
pathway, and the polypeptide product sizes are similar (3). 
The ring-cleavage enzyme catechol 2.3-d.oxygenase is en- 
coded by dmpB (4) amJ catalyzes the conversion of catechol 
to 2-hydroxymuconic semialdehyde. The coding region for 
catechol 2.3-dioxygenase is located approximately 350 bp 
downstream from dmpP. The amino acid sequence of Jus 
enzyme was found to be 83 to 87% homologous to catechol 
2 3-dLygenase from the TOL and NAH7 naphthalene 
catabolic pathways (4). The dmpC and impD genes encode 
the first enzymes of the branches of the m,/«-cleavage 
pathway: 2-hydrcxymuconic semialdehyde dehydrogenase 
S 2-hydroxymuconic semialdehyde hydrolase respec- 
tively. The dmpB, C. and D genes are closely linked with 
me genie spacing of 38 and 11 bp (31). Although nucleottde 
sequence information for isofunctional genes is not l avail- 
able, the product of dmpC. 2-hydroxymuconic semialdehyde 
dehydrogenase, exhibits approximately 40% sequence iden- 
tity with aldehyde dehydrogenases from eukaryotic sources 

<3 Here we report gene locations, nucleotide sequences 
Doiypeptide analysis, and associated activities for the res. ot 
ihe m*/<i-cleavage pathway genes of Pseudomonas p. strain 
CT 55 This work completes the gene structure informa ion 
for the m t >r«-cleavage enzymes and provides evidence that 
the phenol/3,4-dmp catabolic operon is composed ot 13 
uenes Moreover, the nucleotide sequence information in- 
cludes the discovery of a new °^ snc ^^ a A ^fl 
age pathway enzyme that can metabolize acetaldehyde. This 
work should considerably aid future efforts to increa e the 
comparatively meager knowledge of m^-cleavage pathway 
enzymology. 

MATERIALS AND METHODS 
Bacterial strains and plasmids. The bacterial grains and 
plasmids used in this study are listed in Table 1. Phumids 
Sere introduced into E. coli strains by the procedure o 
Kushner (27) and into Pseudomonas sp. strain PB2701 either 
by conjugation from E. coli S17-1 or by; el^roporation with 
a Bio-Rad Gene Pulser. Ampicillin (100 W5/ml) and carben- 
fcillm (1 to 2 mg/ml) were used for selection of plasmid- 
encoded ^lactamase in £. coli and Pseudomonas strains, 

reS pTa£dswere constructed by using standard recombinant 
techniques. The plasmids designed pVI are bas«ion the 
broad-host-range tac express.on vecto s PJJMB66HE and 
P MMB66EH or derivatives thereof (pMMBttHEi ana 
JmMB66EHA, respectively). ft* J £ 

pMMB66A vectors are indicated by a A symbol. The extent 
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TABLE 1. Bacterial strains and plasmids 



Strain or plasmid 



Relevant properties" 



Reference o» source 



E. coli 
S17-1 
DH5 
CSR603 

Pseudomonas strains 
Pseudomonas sp. strain CF600 
P. putida U 
P. putida PB2701 

Plasmids 
pBluescript SK(+) 
pMMB66HE and EH 
pMMB66HEA and EHA 
pVI264 
pVI265 
pVI298 
pVI299 
pV1300A 
pVI3^1A 
pVI3i)2A 
pVI303A 
pV1304A 
pVI305A 
pV1306A 
pVI307A 
P V1308A 
pV!309A 
pVI310A 
pVI311A 
pVI312A 
pVI313A 
pVI314A 
pVI3J5A 

pVI316A 



r" m + Tp r Sm r Mob + 
r" m + recAl 
Maxicell strain 



Phenol/3,4-dmp degrader 
Phenol degrader (ATCC 17514) 
r" m + Sm r derivative of KT2440 



Ap r cloning and sequencing vector 

Ap r , RSF1010 based tac promoter expression vectors carryins !aci q 

Derivatives of pMMB66HE and EH without the foci* repressor gene 

dmpKLMNOPQBCDE, Bglll-EcoRl fragment (kb 0.25-10.0) (Fig. 2) 

dmpKLMNOPQBCDEFGHl, BglXX-BamHX fragment (kb 0.25-19.9) (Fig. 2) 

dmpKLMNOPQBCDEFGHI* BgtlhNml fragment (kb 0.25-15.0) (Fig. 2) 

dmpKLMNOPQBCDEFGM, Bgl\\-Pvu\\ fragment tkb 0.25-13.5) (Fig. 2) 

dmpQ. Bat 31-Ps/I fragment (bp 5367-6240) (Fig. 4) 

dmpDE* Hpal-EcoRl torment (bp 8047-10051) (Fig. 4) 

dmpGHU EcoRX-BamHX fragment (bp 10051-19900) (Fig. 4> 

dmpDEFGH. HpaX-XhoX fragment {bp 8047-12784) (Fig. 4i 

dmpGH. EcoRX-XboX fragment (bp 10051-12784) (Fig. 4) 

dmpGHL EcoRX-BglXX fragment (bp 10051-16450) (Fig. 4) 

dmpFGHl, Nrul-NruX fragment (bp 9105-15000) (Fig. 4) 

dmpFGHl, NruX-PvuXX fragment (bp 9105-13489) (Fig. 4) 

drnpD* HpaX-SacX fragment (bp 8047-9070) (Fig. 4> 

dmpE, DdeX-EcoRX fragment (bp 9033-10051) (Fig. 4) 

dmpF, SalX-SatX fragment (bp 9790-10929) (Fig. 4) 

dmpF, NruX NotX fragment (bp 9105-11702) (Fig. 4) 

dmpG, SauX-BstElX fragment (bp 10785-11921) (Fig. 4) 

dmpH* NotX-XhoX fragment (bp 11702-12784) (Fig. 4) 

dmpl ScaX-PvuXX fragment (bp 12670-13489) (Fig. 4) 

dmpEH. DdeX-XhoX fragment (bp 9033-12784) (deleted between bp 10051- 

11702) (Fig. 4) 
dmpFG* NruX-Kpnl fragment <bp 9105-12553) (Fig. 4) 



41 
17 
38 



40 
14 

MBSC* 



Stratagene 

16 

40 

3 

3 

This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 
This study 

This study 



- r and m refer to host restriction and modification systems, respectively. Antibiotic reliance abbreviations: Ap'. ampicillin; Sm T . streptomycin; Ttf. 
trimethoprim. The catabolic dmp genes are described : n the text. Base pair coordinates are given relative to the last \m*s of the restriction enzyme recognition 

sites. 

b MBSC. M. Bagdasarian strain collection. 



of Pseidomonas sp. strain CF600-derived DNA present in 
these plasmids is listed in Table 1 and illustrated in Fig. 2 and 
4. 

Culture conditions. Cells for crude extract preparation 
were grown as follows. Pseudomonas sp. strain CF600 was 
grown at 30°C in minimal medium, supplemented with 2.5 
mM carbon source and trace metals (33), for approximately 
15 h after inoculation (1:200) of Luria broth-grown cells. 
Additional carbon source was added at 2- to 3-h intervals, 
and growth was continued fcr approximately 6 h. E. coli 
cells were grown at 37°C in Luria broth for 12 to 15 h after 
inoculation (1:1,000). Induction of the toc^-regulated tac 
promoter of pMMB66HE- and pMMB66EH-based plasmids 
was achieved by the addition of isopropyl-p-D-thiogalacto- 
pyranoside (IPTG), to a final concentration of 0.5 mM, at 3 
h postinoculation. 

Analysis of plasmid-encoded polypeptides. Plasmids were 
introduced into the maxicell CSR603 strain; plasmid-en- 
coded polypeptides were prepared, labeled with L-[ 35 S]me- 
thionine (Amersham), and analyzed essentially as described 
previously (38). To aid in the separation of small polypep- 
tides, sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis (SDS-PAGE) was performed with the addition of 32 
mM NaCI in the analytical gel (28, 30). Size estimations of 



polypeptides were performed by using an LMW calibration 
kit (no. 17-044601) and peptide molecular mass standards 
(no. 17-055101) from Pharmacia. 

Nucleotide sequence determinations. Nucleotide sequences 
were determined directly from plasmids by using a Pharma- 
cia DNA sequencing kit (no. 27-168201). To determine the 
nucleotide sequences of the previously unsequenced regions 
of the operon, subfragments were cloned into the polyclcn- 
ing site of a pBluescript SK(+) sequencing vector (Strata- 
gene). Ordered deletion libraries of the resulting plasmids 
were generated by using exonuclease III and mung bean 
nuclease essen : ally as described in the Stratagene Exo/ 
Mung DNA sequencing manual. Both strands were se- 
quenced and the junctions of fragments were confirmed by 
using custom-designed oligonucleotides. 

Chemicals. All chemicals used for the preparation of 
buffers were reagent grade or better. All coenzymes were 
obtained from either Sigma Chemical Co. or Boehringer 
Mannheim. Solutions of 2-oxopent-4-enoic acid were syn- 
thesized from L-ailylglycine essentially as described by 
Collinsworth et al. (12). 4-Hydroxy-2-oxo valerate was pre- 
pared by mild alkaline hydrolysis of 4-methyl-2-oxobutyro- 
lactone (14). 4-Oxalocrotonate was a gift from Peter 
Williams (University of Wales, Bangor). Equilibrium mix- 



714 SHINGLER ET AL. 

tures of the keto and enol forms of this compound were 
prepared by mixing an ethanolic solution of 4-oxalocrotonate 
with 50 mM Tris-HCl (pH 7.4) and allowing the solution to 
stand at room temperature for 1 h before use (20). 

L-(S)-4-Methyl-2-oxobutyrolactone was synthesized es- 
sentially as described by Burlingame and Chapman (8, 9 by 
usine treated crude extracts of phenol-grown P . putida U (9, 
14) or Pseudomonas sp. strain CF600 to accumulate the 
lactone from catechol. Rather than us ng continuous extrac- 
tion with diethyl ether, we used acidified reaction mixtures 
saturated with sodium chloride and then extracted four times 
with equal volumes of ethyl acetate. Yields from the s dif- 
ferent reaction mixtures were approximately 50% of the 
theoretical yield. The compound synthesized from F P ut J^ 
U was previously characterized as the l-(S) isomer (9). The 
compounds obtained with P. putida U or Pseudomonas sp. 
strain CF600 extracts were indistinguishible on the basis ot 
melting point (70 to 73°C) or circular dichroism measure- 
ments. This establishes the stereochemistry of 4-hyc^oxy-2- 
oxovalerafc* formed in Pseudomonas sp. strain CF600 as 

L-(S) 

Crude extract preparation. Cells used for making crude 
extracts were harvested, washed twice with cold 10 mM 
Na + -K + phosphate buffer (pH 7.5), and then used immedi- 
ately or stored at -70°C as a frozen paste. In preparation for 
sonicalion, cells were resuspended in either 50 mM Na -K 
phosphate buffer (pH 7.5) or 50 mM Tns-HCl (pH 7.4) buffer 
containing 2 mM dithiothreitol. Crude extracts were pre- 
pared by sonication of cell suspensions and then centnfuga- 
tion at 83,000 x g for 1 h. The supernatants were kept on ice 
and used within 30 h of preparation. 

Estimation of pruiein concentration. The protein concen- 
trations in ceil extracts were estimated by using a BCA 
(bicinchoninic acid) assay (Pierce Chemical Co.) with bovine 
serum albumin as the standard. The protocol for 60 C 
described in the manufacturer's instructions was used, with 
a modification (7) to eliminate interference from the dithio- 
threitol in the crude extracts. 

Enzyme activity assays. All enzyme assays were performed 
at 25°C except for the acetaldehyde dehydrogenase (acylat- 
ing) assay, which was carried out at 20°C. 

4-Oxalocrotonate isomerase was assayed by measuring 
the decrease in A 29S . The reaction conditions were essen- 
tially as described previously (36), except that the assay was 
conducted in 10 mM Na + -K + phosphate buffer (pH 7.4) (0.5 
ml). One unit of enzyme activity is defined as the amount ot 
enzyme required to cause an A 29 < decrease of 1.0 per min in 
a 3-ml reaction volume (36). 

4-Oxalocrotonate decarboxylase activity was measured by 
determining the amount of substrate remaining, from the 
A 350 after the reaction mixtures v ere quenched with NaOH- 
EDTA (18), at 15-s intervals over a 1-min period. The 
reaction mixtures (0.5 ml) contained 50 mM Tris-HCi (pH 
7.4), 2.7 mM MgS0 4 , and 125 p.M 4-oxalocrotonate (equi- 
librium keto-enol mixture). To ensure that the formation of 
the keto form of 4-oxalocrotonate was not rate limiting in the 
later part of the reaction, 5.5 U of 4-oxalocrotonate isomer- 
ase, obtained by heat treatment (20, 36) of extracts from cells 
harboring pVI314A, was added to the assays with extracts 
that lacked this activity. One unit of activity is defined as the 
amount of enzyme required to catalyze the conversion of 1 
jtmol of substrate per min. 

2-Oxopent-4-enoate hydratase activity was determined by 
the method of Collinsworth et al. (12), except that the buffer 
used was 45 mM Tris-HCl (pH 7.2), and 2.9 mM MgS0 4 was 
used in place of MnCI 2 (20). As defined by Collinsworth et al. 
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(12), 1 U of activity is the amount of rnzyme required to 
cause a decrease in A 265 1.0 per min in a 3-ml reaction 
volume. 

The assay for 4-hydroxy 2-oxovalerate aldolase was car- 
ried out by monitoring the oxidation of N ADH (A^ in th 
presence of excess lactate dehydrogenase, which reduces 
pyruvate formed by the action of the aldolase (36). Reaction 
mixtures (1 ml) contained 46 mM N-2-hydroxyethylpipera- 
zine-N'-2-ethanesulfonic acid buffer (pH 8.0), 1 mM MnCl 2 , 
260 \iM NADH, 260 \lM 4-hydroxy-2-oxovalerate, 28 V of 
lactate dehydrogenase, and 0.4 to 0.5 mg of crude extract 
protein. The reaction was initiated by the addition of 4-hy- 
droxy-2-oxovalerate, and rates were corrected for the pres- 
ence of NADH oxidase activity. Pseudomonas extracts 
contained high levels of alcohol dehydrogenase activity, so 
that in the absence of lactate dehydrogenase, acetaldehyde 
formed by the aldolase could then be reduced by this 
enzyme. Thus, in phenol-grown Pseudomonas extracts in 
the absence of lactate dehydrogenase, NADH was con- 
sumed at approximate^ 50% of the rate observed in the 
presence of lactate dehydrogenase. In contrast, the alcohol 
dehydrogenase activity in £. coli extracts was very low, so 
that, in the absence of lactate dehydrogenase, extracts that 
exhibited aldolase activity did so at a rate of only 15 to 20% 
of that observed in the presence of lactate dehydrogenase. 
One unit of activity is defined as the r mount of enzyme 
required to catalyze the oxidation of 1 p.mo\ of NADH per 
min in the presence of excess lactate dehydrogenase. 

The standard assay adopted for measurement of acetalde- 
hyde dehydrogenase (acylating) activity involved monitoring 
the coenzyme A-stimulated reduction of NAD (A^. As- 
say mixtures (1 ml) contained 50 mM Na + -K + phosphate 
buffer buffer (pH 7.5), 285 *iM NAD + , 10 mM acetaldehyde, 
and 0.2 to 0.4 mg of crude extract protein. The A 340 of this 
mixture was monitored for 1 min, and the reaction was then 
initiated by *he addition of coenzyme A (10G jtM). One unit 
of activity is defined as the amount of enzyme required to 
reduce 1 *mcl of N AD + per n ;n in the presence of coen- 
zyme A under these conditions. 

Nucleotide sequence accession numbers. The nucleotide 
sequence data in this paper have been submitted to the 
EMBL data library under accession numbers X60835 and 
X60836. 

RESULTS 

Mapping of the phenol/3,4-dmp operon. The plasmid-en- 
coded phenol/3,4-dmp catabolic pathway of Pseudomonas 
sp. strain CF600 involves a multicomponent phenol hydrox- 
ylase and a subsequenv /ne/a-cleavage pathway (Fig. 1). The 
genes for the enzymes of this pathway was previously found 
to be clustered on an approximately 20-ko SacUo^BamHl 
fragment (3). The nucleotide sequence of the genes involved 
in the first four biochemical steps of the pathway , dmpK, 
M, N, O, P. B, C and Z>, was previously determined, 
allowing unambiguous location within the 20-kb Sacl-to- 
Bam\\\ fragment (Fig. 2). 

Expression of the 20-kb SacUo-BamHl fragment confers 
on pseudomonads the ability to grow on phenol ^and its 
methylated derivatives m-, o-, and p-cresoi and 3,4-dmp. 
The location of dmpK. U M, AT, O, P. B, C, and D within 
this fragment suggests that the remaining genes of the 
pathway are encoded downstream of dmpD. To turtner 
define the location of the pathway genes, M^**™*' 
lives of pV*265, which expresses most of the 20-kb Sacl-to- 
BamHl fragment, were constructed and tested for their 
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FIG. 2. Restriction map of cloned pVH50 DNA. Arrowheads indicate the direction of transcription from the far promoter of the 
expression vector. The location of previously sequenced phenol catabolic genes are shown. Coordinates of restriction endonuclease 
recognition sites are given in kilobases. B, BamHU Bg, BgllU E. EcoRU Nr, Nru\: Pv. PvulU S, Sad The ability of P. putida PB2701 
harboring the plasmids to grow on M9 salts (3) supplemented with different carbon sources and IPTG (0.5 mM) is indicated. 



ability to confer growth on phenol and methylphenols. The 
results of these experiments (Fig. 2) demonstrate that a 
13.25-kb Bglil-Xo-PvuU fragment is sufficient to encode the 
entire phenol/3 ,4-dmp catabolic pathway. 

Nucleotide sequence of the phenol/3,4-dmp genes. The 
complete nucleotide sequence of the 13.25-kb Bglll-to-PvuM 
fragment was determined as described in Materials and 
Methods. The sense strands of the previously unsequenced 
regions are shown in Fig. 3, along with translation of six 
open reading frames (ORFs) that are preceded by putative 
ribosome binding sites. The location, size, intergenic spac- 
ing, and G+C contents of these ORFs relative to those of 
known genes of the pathway are summarized in Table 2. The 
ORFs, designated dmpQ* £, F, G, tf, and /, have high G+C 
contents because of preferential usage of G and C at the third 
codon position, as has been found for the other genes of the 
pathway and for Pseudomonas genes in general (30, 48). 

The dmpQ gene is located between dmpP and dmpB, 
whereas dmpE, F, G, //, and / are located downstream of 
dmpD. Thus, the coding order of the operon is dmpKLM 
NOPQBCDEFGHI, and the intergenic spacing varies be- 
tween 0 and 70 nucleotides. At the dmpQ-dmpB and dmpG- 
dmpH junctions, the last base of the last codon of one gene 
is the firs* base of the ATG start of the next gene. The very 
tight clustering of the genes suggests that the 15 genes 
comprise a single operon. This is consistent with previous 
data where a transposon insert in dmpP inactivated the 
expression of downstream genes (40). 

The nucleotide sequence was also analyzed for potential 
regulatory signals. A putative promoter region, 30 bp up- 
stream from the transcriptional start of dmpK, was previ- 
ously reported (30). This promoter region has strong homol- 
ogy to the nif-Wkt promoters that have been shown to be 
regulated by the alternative a 54 factor (15, 25). No other 
regions with homology to either the £. coli -35 TTGACA 
-10 TATAAT cr the m/-like -24 TGGC -12 TTGCT 
consensus sequences were found within the entire coding 
region. Likewise, no sequence similar to terminator regions 
(32) was found in the coding region or downstream of dmpl* 
the last gene of the operon. However, the nucleotide se- 
quence downstream of dmpl has comparatively low G+C 
content, has A and T tracks, and shows no preferential 
codon usage in any of the three reading frames. Further- 
more, the expression of this entire downstream region does 
not result in the production of detectable protein products 
(see below). These features suggest that this is noncoding 
DNA, but it remains to be determined how and where 
transcription terminates. 



Polypeptide product' if the dmp operon. The predicted and 
observed sizes of the polypeptide products of dmpK, L, M, 
Af, O, P. B, C, and D were previously determined (Table 2). 
To investigate the size of the gene products of dmpQ, £, F, 
(7, f/, and /, a series of plasmids expressing different regions 
of the DNA spanning the operon were constructed by using 
the pMMB66EH A and pMMB66HEA tec expression vectors 
(Fig. 4A). To visualize polypeptide products, the plasmids 
were introduced into the maxicell strain CSR603, and the 
plasmid-encoded polypeptides were analyzed by SDS- 
PAGE (Fig. 4B). 

Plasmid pVI300A contains DNA spanning dmpQ and part 
cfdmpB. From the nucleotide sequence, this plasmid would 
be predicted to express two polypeptides, one of 12.2 kDa 
corresponding to the product of dmpQ and one of 25.0 kDa 
comprising the first 162 amino arid residues of the dmpB 
gene product fused to a sequence of 67 unrelated amino acid 
residues. \ VI300A mediates production of two novel poly- 
peptides, AB (23.0 kDa) and Q (12.0 kDa), of approximately 
the predicted molecular mass (Fig. 4B ( lane 1). Plasmids 
pVI301A through pVI307A (Fig. 4B, lanes 4 through 10, 
respectively) contain overlapping regions of the opeion and 
mediate production of different combinations of six novel 
polypeptides: D (30.0 kDa), E (28.0 kDa), F (35.0 kDa), G 
(39.0 kDa), H (28.5 kDa), and I (6.7 kDa). Polypeptide D 
(30.0 kDa) was previously shown to be the product of dmpD 
(3). Comparison of the DNA contents and polypeptides 
produced from each plasmid indicates that the polypeptides 
are encoded in the order D-E-F-(G or HHG or HM. Plas- 
mids pVI308A through pVI314A (Fig. 4B, lanes 11 through 
16, respectively) were used to map the relative order and 
location of the coding regions for the polypeptides more 
precisely. These plasmids express DNA spanning each of 
the genes dmpD* E. F, C, f/, and / in turn. Each plasmid 
independently expresses a polypeptide indistinguishable on 
the basis of size from polypeptide D, E, F, G, H, or I. Some 
of these plasmids express the individual polypeptides at 
levels lower than those of plasmids expressing more than 
one gene; consequently, vector-encoded proteins can be 
seen more clearly in these lanes. The apparently decreased 
level of expression of the polypeptides from these plasmids 
might be the result of lower transcriptional levels because of 
promoter location and/or decreased translation due to 
mRNA stability or disruption of coupled transcription-trans- 
lation that can occur on polycistronic mRNAs. Neverthe- 
less, the results clearly demonstrate the relative coding order 
D-E-F-G-H-I and define the coding regions as follows: D, 
between Hpa\ and Sad (bp 8047 through 9070); E, between 
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4 -,«Av*t. 1*37764 GCCCTCTTCA AGCOJATCTG 5410 
dspOXSOV sequences 1 - 5488 bp accession mnfaer M37764 <fapP K R I * 

J^L^^^f^TffTrfHr.Tff 5550 

. , , nd YS2805 GGACGCCGCT GCCKmOCT 9030 

*#Ba> sequence. 5716 - 9102 bp accession nunfcere H33263 and X52805 ....^ A I S 

G&GAGAGACG AAAAYGGACft AGATJTH5AT CWkCGBGCTC GGCGACGBGC TGTROCAGGC AATGCTCMT CGCGAGGCGG TCSGCTOGCT 9120 



.xLLacaLc^o^ 
<J^aLLLL^ 

TOO V K A GD»» KVi/ * _ 

- -- ».m»wv- rv-scrfiACCT GATGATCAAG GTGCTGCGCA ACUOCAAGX& 

GOCATGWC CAGMACTCA AAGTCGCGAT CATCGOTCG GCRCC5ACCT GATGATGPALi v l r „ A K y 

M N QKL KVAI I G S — «— 



^ ««» «»«. «-r?rr. , rpfrrrfrr5^ 

V H Y A K 1 V * S __ TCATCATGAA CCCGQCIGAG CCGCOJCTGA TCATOCQCGA 10470 

CA3CJUUUS0C ATCGAAGTGA TOGBIQBIQC GGCCAMSGQC AAGGCQttCA TCATCATGRA w £ » p j, I M R D 

c k A I E V I G A A K G K A a a a m » 

~.«««rv. inHMM CGSTGGCGGA AATGGTTCAG GCGGTGCAGG CCTACGTQCC 10560 

CMOBIGM GIGCIGIO0B CCOCCGCOGA TCAOGCCGOC ™*«^ a M V0 A V Q AYVP 

_ v - vLS A A A D OA* V A A s v » » 

^^.__ rr n-awHGM CATCCOCGCT CTCGGCCGCT TCAGCGGGTT 10650 

OGGCXATOGC CIGAAGCAGC AGGK3QMSIT CGJGCTGATC COOGASIO0G OQMGCTGMk CflTCOOCGCT ^ ^ r f s g L 

sLLL l^oc LL^ cccc^t tacc^, cc^xcc,* CAjaaaac rrfr 10.0 

nvTLRDGSHAIR»W* A * J *' 

D ^.^-™«r- mhsctCBIC TTTCAACTAT GGCTTCGGAC GGCACACTGA CCTGGAGIAC 11010 

AAMTCGACA GCMCGMGT a^^* 00 ^ ^^^J^ TTTCAWXI Ai q f Q R g T 0 LEY 

their one-leticr code, and the asterisks indicate stop codons. 



Vol. 174. 1992 



PSEUDOMONAS SP. STRAIN CF600 PHENOL CATABOLIC OPERON 717 



ATCGA6GCGG TGQOCGGTGA GATCAGCCAC GCGCAAATTG CCACCTTGCT GCTGCCGGGG AXCGGCAGCG TG CA TGACCT GAAGAACSCC 11100 
ISA VAGE ISA A Q I ATX.L LP6 IGS VHDL XHA 

TATCAGGCTG QC GO OC GA GT GCCACOCATT GCACOGAAGC CGAOGXCTCG AAGCAGCACA TCGAGTACGC OQGCMOCTT 11190 

YQA GARY V R V ATH CTEA DVS K U H I E Y A RHL 

GGCAIGGAIA CCGTCGGCTT OCTGRTGATG AGCCACATGA TOOOGQOCGA GAAACTGGCC GAGCAAGGCA AGTTCATGGA GAGCTAOGGC 11280 
GMD TVGF LHH SHU I ? A E KLA KQ<? KLME 3 Y G 

GCGAOCTGCA TCTACATGGC CGACTCCGGT GGGGCGATGA GCATGAAOGA CAITOGTGAT CGGA7 GCGCG OGTTGAAGGC CGTGCTCAAG 11370 
A T C IYMA DSG GAM SMND I R D R M R AFKA VLR 

CCCGAAACGC AGGTCGGTAT GCACGCGCAC CACAAOCTCA UUafTUUUtfl AGCCAACICX ATTG.TGCCG TGGAAGAAGG CTGCGACCGT 11460 
PET QVGM BAH HNL SLGV ANS I *i A VEBG CDR 

GTGGACGOCA GCCIGGOCGG CATGGGCGOC GGTGCCGGCA ACGCACCACT GGAAGTGTTC ATOG^ r TG CAGAAOGGTT GGGCIGGAAC 11550 
VDA SLAG MGA GAG NAP L EVF I A / AERL GHN 

CATGGCACOG ACCTCTACAC OCTGATGGAT GO0G0O GA OG ACAXCGTCCG OCCGTTGCAG GATCGCCCJG TGOGGGTOGA OOGOGAGAOC 11640 
HGT DLYT LMD A A D DIVR PLQ DRP VRVD RET 

CTCGGCCTCG GATATGOOGG TG K TATTOC AGCTTOCTGC GTCAOGOOGA GATCCCGGCC GOCAAAXACA AOCIGAAAAC CCTOGACATT 11730 
L G L GYAG VYS SFL RHAE Z A A AKY N L K T L D I 

CTOGTOGAAC TGGGACAGOG GOGCATGGTC GGOGGOCAGG AAGACATGAT OGTGGACGTC GCCCTOGACC TGTTGGCGGC CCACAAGGAG 11320 
LVE LGHR RHV GGQ EDMI VDV ALO L L A A HRB 

AAOOGGGCAT GAAOCGCACT CTGAOOCGTG AOCAGGTGCT GGCCCTGGCC GAACACATGG AAAATGOOGA ACTCGATGTG CATGACATOC 11910 
N R A * 

cbpfl M NRT I* T R DQVL ALA EHI E H A E LDV HDZ 

OCAAGGTGAC CAAOGATTAC CCGTTJATGA CCTTTGCOGA CGCCTACGAC GTGCAGTGGG AAATOGGTOG GGGCAAGGAA GOOOGTGGTA 12000 
PKVT NDY PDM T F A D A Y 0 V Q W EIRR RRF ARG 

ACAAGGTGGT CGGTCTGAAG ATGGGOCTGA CGTOCIGGGC CAAAAXGGCG CAGATGGGCG TGGAAACACC GATCIAOGGC TJ OC T1 V 1 U* 12090 
N K V V GLK MGL T S W A X M A QMG V E T P IYG FLV 

ACIACTTGAG OGTGOOGGAT GGOGGTGXGG TGGATACCTC GAAGCTGATC CATCCGAAGA TOGAGGCAGA AAICAGCTTC GTCACCAAGG 12180 
DYFS VPD GGV VDTS K L I HPK ZEAE ZSF VTK 

CTCGATTGCA CGGCCCTGGC TGCCATA7CG GCCAGGTGCT GGCGGCXAGC GACITCGiGA TKXGACAGT OGAAGZGaXC GACTCACGTT 12270 
APLH GPG CHI GQVL A A T DFV IPTV EVI OSR 

ATGAGAACTT CAAGTTOGAC CTGATCAGOG TGGTGGOOGA CAACGCATCG TOGACCCGTT TGATCAOOGG TGGGCAGATG GOCAAOGTGG 12360 
YENF KFD LIS VVAD NAS STR FZTG GQM A N V 

OGGATCTGGA TCTGCGCACA CTCGGCGTGG TGAXGGAAAA GAACGGCGAA GTGGXAGAAC TOGGOGOCGG TGCOGCAGTG CTTGGCCATC 12450 
ADLD LRT LGV VHEX MGE VVE LGAG AAV LGH 

CGGCTTCCAG OGTGGOGATG TTGGCCAATC TGGTGGOOGA GCGTGGTGAG CATATCCCCG CGGGCAGCTT CATCATGACT GG09GCAICA 12540 
PASS V A M LAN LLAB RGB HIP AGSF IMT GGZ 

CCGCTGCGGT ACCGGTGGCA COGGGCGACA ACATCAOGGT GCGCTATCAG GGCCTTTGCT CGGTGAGTGC GCGCTTCAIC TAACCCTGTG 12630 
T A A V PVA PGD HITV RYQ GLC SVSA R F I * 

TGGCCGGCGT GCGCGCCGGC GKTTGTCGAC TAGGAGTACT GCAATGCCGA TTGCTCAGCT TTACATCATC GAAGGTC3T\ CCGACGAGCA 12720 

dspl HP I A Q L YII EGR TOEU 

GAAGGAAAOC TTGATCCGCC AAGICAGCGA AGCGATGGCG AACTCGCTGG ATGCACCCCT CGAGCGGGTG CGTGTGCIC A TTACCGAAAT 12810 
KET LIR QVSE A M A NSL DAPL ERV RVL ZTEN 

GCCGAAGAAT CACTTCGGCA TTGGTGGOGA GCCGGCAAGC AAGGTCAGGC GCTAGTATGA GAGTGTTC3T TAAAGAGTTT GATTTGAAIC 12900 
PKH HFG IGGB PAS KVR R* 

CCAGAGCGGG TTGATAGGCA GAAAGAGCT A CTULTCTUTT T CGCCGCTCA AA CGCAAA GT GCC CCT C I G T GGCAGGAGGC ACTTTTTAGA 12990 
TGAA aCtiMi TATTTAGOTT AGTCCTCGAC AIGCAGGCCG CACTCGGCCC GCATCTCATT AICAIGZOGA AGCTGATCGG GAAATTTGGG 13080 
TTTTOCICTT O UA TTTUU/T GACCATGGCA TCAAAGCAAG CCTCGGTGCC AXAGCCTGAC ACAIGGTTGG AGCTGCTCTT TGTATAGAGA 13170 
GCATGGCCTT GCTGAGGGCC TACAICGCTG CGGTAAGCCG l^lllTia (j CTA C I 'O' I 'CU AAAICCAGGT AOCAACCTGA GTOCATXAGG 13260 
GAAGGTCTGA AGTAQCCCTG TATTTCOGGT CGACTTCAGC OCYCGCTGCT TTTGAAAGCG GGCCGTCGAT CAAATTCGAC CAGGTAACCC 13350 
GCTCAGTTCT CCGTTTTTGG CCGCCGCGAG CAOCTCGCAG CAGCCATTTT CCGCATTACA GGTCCAOCTT TCCTGCGGTA GTCAGCAAAT 13440 
GTCGGCGCGC CATCCATAGG TTCGACAGCG CGAACAGCGT CACCAGCTG 13489 

FIG. 3— Continued. 



Ddel and EcoRl (bp 9033 through 10051); F, between Sail 
and Sail (bp 9790 through 10929); G, between Satd and 
BstEll (bp 10785 through 11921); H, between Notl and Xhol 
(bp 11702 through 12784); and I, between Seal and Pv ll (bp 
12670 through 13489). Hence, the sizes and order of poly 
peptides correlate well with the sizes and order of the genes 
(Table 2). 

The plasmids pVI302A and pVI305A through pVI307A 
carry DNA in addition to coding regions. The additional 
DN A ranges from approximately 0.63 to 7.04 kb downstream 
from the end of dmpL However, these plasmids do not 
mediate the production of any polypeptides other than those 
defined above (Fig. 4B, lanes 5, 8, 9, and 10). This observa- 
tion suggests either that transcriptional termination occurs 



s*.»on after the end of the dmpl coding region or that this is a 
noncoding region. 

Lower mito-cleavage enzyme activities. The four known 
activities of the /new-cleavage pathway that have not previ- 
ously been assigned to dmp gene products are 4-oxalocroto- 
nate isomerase, 4-oxaiocrotonate decarboxylase, 2-oxopent- 
4-enoate hydratase, and 4-hydroxy-2-oxovalerate aldolase. 
Crude extracts from cells harboring pVI265 and pVI299, 
which express the complete pathway from the IPTG-induc- 
ible tac promoter, contain approximately 25 to 50% of the 
levels of these enzymes found in crude extracts from phenol- 
grown wild-type strain CF600 (Table 3). Crude extracts from 
a vector-bearing control strain showed only basal levels of 
these enzyme activities, similar to those found for the 
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TABLE 2. Summary of the genes and gene products of the phenol/3 ,4-dmp catabolic operon 



J. Bacteriol. 



Gene 


Coordinates 
(bp) 


Intergenic 
spacing" 
(bp) 


dmpK 


745-1020 


55 


dmpL 


; 076-2068 


6 


dmpM 


2075-2344 


14 


dmpA 


2359-3909 


70 


dmpO 


3980-4336 


13 


dmpP 


4350-5408 


11 


dmpQ 


5420-5755 


0 


dmpB 


5755-6675 


37 


dmpC 


6713-8170 


10 


dmpD 


8181-9029 


14 


Qftiph* 




i e 


dmpF 


9845-10780 


14 


dmpG 


10795-11829 


0 


dmpH 


11829-12620 


53 


dmpi 


12674-12862 





Putative ribosome 
binding sites'' 



in ORF 



No. of 
amino acid 



Molecular mass 
ikDa) 



residues Predic(ed Esljmaled 



Function 



Reference 
or source 



CGCAAGCCGCCAACCTSaSfiATt; 59.4 

Met 

MCAASAfiGGTACGGTTGATATG 63.4 

Met 

AAAGCCGCSES&ATAAAGCATG 60.7 

Met 

AAGAACTAfiGAGACAAGCTCATG 59.4 

Met 

AAGAAC AAfiAGfiTTTCG ATCATG 61 .8 

Met 

GTGCAGCTGAGAGGTGTGTCATG 65.6 

Met 

AAGCGCATCTfiA&iTGAACCATG 64.5 

Met 

AACCAGCAAGAGGTGTCGTCATG 61.0 
Met 

TTTTTGC AS5EATTGCGCAGATG 66. i 

Met 

CGTGAAGTTGTfiAfiGCAGCCATG 66.7 

Met 

ATTTCCTfiAGAGAGACGAAAATG 64.2 

Met 

CCTAAATT GGAG GCTTGCAfiATfl 65.3 

— _ — Met 

aacgccIgSSESSctgcaccatg 61.9 

Met 

GCCCACAAEGAGAACCGCGCATG 59.9 

Met 

TGTCGACTSngSgTACTGCAATG 57.6 
Met 



92 


10.6 


12.5 


Unknown 


30, 33 


331 


38.2 


34.0 


Phenol hydroxylase com- 


30, 33 


90 






ponent 




10.5 


10.0 


riiciiui fiyuruxyiase coin* 


30, 33 


517 






nnnpnf 




60.5 


58.0 


rn^noi nyaroxyiase com* 


30, 33 


119 






ponent 




13.2 


IJ.V 


Phenol hydroxylase com- 


30, 33 


353 






ponent 




38.5 


39.0 


rncnui nyuroxyiase com- 


30, 33 








pone n i 




112 


12.2 




Unknown 


This study 


307 


35.2 


11 ft 


Catechol 2.3-dioxygenase 


3,4 


486 


51.7 


50.0 


--nyuroxymuconiC sennaj* 


3, 31 


283 


31.0 




dehyde dehydrogenase 




3(\0 


2-Hydroxymuconic semial- 


3, 31 


261 






dehyde hydrolase 




27.9 


28.0 


2-Oxopent-4-dienoate hy- 


This study 








dratase 


312 


32.7 


35.0 


Acetaldehyde dehydroge- 


This study 


345 






nase (acylating) 


37.5 


39.0 


4-H ydrox y-2-oxo valerate 


This study 








aldolase 


264 


28.4 


28.5 


4-Oxalocrotonate decar- 


This study 


63 






boxylase 


7.1 


6.7 


4-Oxalocrotonate isomer- 


This study 








ase 





Over- and underlined bases indicate sequences complementary to the 3' ends of the 16S rRNAs of £. voli and />. tierttf>inosa t respectively (39). 



uninduced acetate-grown wild-type CF600 strain (Table 3). 
The background levels of 4-hydroxy-2-oxovalerate aldolase 
activities are relatively high. This is consistent with previous 
work, in which similar noninduced levels were observed in 
£. coli strains (3) and in certain Pseudomonas strains (37), 
Measurements of lower m?/a-cleavage pathway enzyme 
activities in E. coli strains expressing different combinations 
oidmpE, F, G, H, and / were used to correlate these genes 
with enzyme function. 

dmpi encodes 4-oxalocrotonate isomerase activity. The 
dmpi gene was found to encode 4-oxalocrotonate isomerase 
activity. Crude extracts from a strain harboring pVI314A, 
expressing dmpi alone, exhibited this activity (Table 3), 
albeit at a level significantly lower titan that or extracts from 
strains expressing the whole operon. Crude extracts from a 
strain harboring pVI303A. which encodes dmpDEFGH but 
not dmpi, lack this activity (Table 3). 

dmpE encodes 2-oxopcnt-4-enoate hydratase activity. 2-Oxo- 
pent-4-enoate hydratase activity was found to be encoded bv 
the dmpE gene, since crude extracts from a strain harboring 
pVI309A, expressing dmpE alone, contained this activity 
(Table 3). As observed for the isomerase, the hydratase 
activity when expressed in isolation was lower than that 
observed in crude extracts from strains expressing the 
complete pathway. Two possible explanations for this are 
lower expression levels and instability of the hydratase in the 
absence of another pathway enzyme(s). The specific activity 
of the hydratase from strains harboring pVI309A varied 
somewhat between experiments (twofold higher or lower 



than the activity shown in Table 1 ), suggesting the possibil- 
ity of some instability of this enzyme. Strains harboring 
pVI303A (dmpEFGH) and pV1315A (dmpEH) contain repro- 
ducibly high hydratase activity levels. In the maxicell exper- 
iments (Fig. 4B, lanes 6 and 12), pVI303A and pVI309A 
express the </wp£gene product at similar levels; hence, the 
level of hydratase activity observed in the strain harboring 
pVI309A is somewhat lower than expected based on the 
apparent expression levels. Taken together, these data may 
indicate that the hydratase is less stable and/or less active in 
the absence of the dmpH product (see below). However, 
conclusive experiments must await purification of the en- 
zymes. 

dmpH encodes 4-oxalocrotonate decarboxylase activity. One 

complication in determining which gene encodes 4-oxalocro- 
tonate decarboxylase is the lack of 4-oxalocrotonate decar- 
boxylase activity in the absence of the hydratase. The 
decarboxylases from the /we/a-cieavage pathways encoded 
by the TOL plasmid (20) and P. putida U (12) have been 
shown to be tightly associated with their respective hy- 
dratases; preliminary protein purification data from our 
laboratory suggest a similar tight association of the decar- 
boxylase and hydratase from Pseudomonas sp. strain CF600 
(unpublished observation). Attempts to express the TOL- 
encoded decarboxylase in isolation resulted in low or unde- 
tectable levels of activity (20), indicating the necessity for 
the presence of the hydratase for the detection of decarbox- 
ylase activity. 

No 4-oxalocrotonate decarboxylase activity was detected 
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from strains harboring plasmids expressing dmpE* F, G, //, 
or / in isolation (Table 3). However, decarboxylase activity 
was detected in all strains that coexpressed dmpH and dmpE 
(hydratase). Plasmid pVI315A expresses only dmpE and 
dmpHy and a strain harboring this plasmid has both hy- 
dratase (dmpE) and decarboxylase activities. Hence, dmpH 
is clearly associated with decarboxylase activity. No decar- 
boxylase activity was detected from strains expressing 
dmpH in isolation (pVI313A), even when vastly increased 
amounts of crude extract protein were used in the assay. The 
polypeptide levels observed from plasmids pVI303A and 
pVI313A in maxicell experiments (Fig. 4B, lanes 6 and 15) 
indicate that pVI313A expresses the decarboxylase at an 
approximately fivefold lower level than does pVI303A. How- 
ever, since strains harboring pVI303A exhibit decarboxylase 
activity that is approximately 100-fold higher than back- 
ground levels, we would expect to be able to detect decar- 
boxylase activity in strains harboring pV1313A, despite the 
fivefold lower expression level. Thus, we conclude that no 
decarboxylase activity was found in strains harboring 
PVI313A (dmpH) because of the absence of coexpression of 
the hydratase rather than because of low expression levels. 
Attempts to reconstitute decarboxylase activity by mixing 
extracts of strains expressing the dmpE and dmpH in isola- 
tion were unsuccessful (data not shown). This is consistent 
with the notion that these two gene products must be 
coexpressed to allow correct physical association. 

dmpG encodes 4-hydroxy-2-oxovalerate aldolase activity. In 
some bacteria that degrade aromatic compounds, 4-hydroxy- 
2-oxovalerate aldolase appears to be a noninducible enzyme 
(36). However, the results shown in Table 3 clearly demon- 
strate the inducible nature of this aldolase in Pseudomonas 
sp. strain CF600. The dmpE. F, and / genes encode snecific 



n ™* 4 * A " alysis of lhe Polypeptides produced from the dmpQ, 
D, £, F G, and / genes of the phenol/3 ,4-dmp operon. (A) The 
extent of the DNA in a plasmid series is shown with a summary of 
the polypeptides produced from the plasmids tested. Wavy lines 
indicate the locations of discontinuity in the scale. Coordinates of 
restriction endonuciease recognition sites are given in base pairs B 
BamHU Ba, Bat 31; ; D, Ddel; E, £coRI; H, Hpal; Nr, Nrul; W. 
Pvull; S, Sacl; Sa, Safl; Sc, Seal: Su, SauL Only the sites used in 
the construction of plasmids are shown. (B) Maxicell analysis of 
plasmid-encoded polypeptides. Polypeptides from CS4603 harbor- 
ing the plasmids indicated in panel A were separated by 15 to 20% 
acrylamide gradient SDS-PAGE with 32 mM NaCI in the analydcal 
gel. Lanes 6 and 12 have half samples loaded to aid visualization of 
individual bands. The molecular mass standards are shown in 
kilodaltons. AB (23.0 kDa), Q (12.0 kDa), D (31.0 kDa), E (28 0 
kDa), F (35.0 kDa), G (39.0 kDa), H (28.5 kDa), and I (6.7 kDa) 
indicate novel polypeptide bands encoded by the different plasmids. 
The relative molecular mass of polypeptide I was determined by 
using Pharmacia peptide standards (nc 17-055101; 17.21, 14.6, 8.24, 
6.38. and 2.56 kDa) (daia not shown). 



wf/o-cleavage pathway activities (see above). Therefore, 
the 4-hydroxy-2-oxovaIerate aldolase activity must be en- 
coded by either dmpF or dmpG. As discussed below, dmpF 
alone encodes acetaldehyde dehydrogenase (acylating), 
leaving dmpG as the obvious candidate to encode the 
aldolase. Strains harboring either pVI303A (dmpDEFGH) or 
PVI316A (dmpFG) exhibit high aldolase activity at similar 
levels, whereas extracts from pVI311A idmpF) contain no 
detectable aldolase activity. These results demonstrate the 
correlation between dmpG and the presence of aldolase 
activity. The expression of dmpG alone did not yield aldo- 
lase activity (pVI312A, Table 3). This result was surprising, 
on the basis of polypeptide levels expressed from pVI303A 
and pVI312A (Fig. 4B, lanes 6 and 14), which indicate a 
reduction of only 5- to 10-fold in polypeptide levels when 
dmpG is expressed on its own. Since pVI303A expresses 
aldolase activity at 15- to 20-fold above background, we 
would expect to detect aldolase activity in extracts from the 
pVI312A-bearing strain. Although we cannot completely rule 
out the possibility that low levels of expression from 
PVI312A may account for the lack of activity in this strain, 
an alternative explanation is possible. High aldolase activity 
in strains that coexpress dmpF and dmpG might indicate 
dependence of the aldolase activity on the presence of the 
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TABLE 3. Enzyme activities of strains expressing various genes 
of the phenol catabolic operon" 



Enzyme activity 



Extract 



Pseuaomonas sp. strain 
CF600 (wild type) 
Phenol grown 
Acetate grown 

£. coli DH5 
pMMB66EHA (vector 

control) 
pVI265 (whole operon) 
pVI299 (whole operon) 
pVI303A {dmpDEFGH) 
pV1309A (dmpE) 
pVI310A idmpF) 
pVI311A {dmpF) 
pVI312A {dmpG) 
pV1313A {dmpH) 
pVI314A idmpl) 
pVI315A (dmpEH) 
pVI316A (dmpFG) 



40! 


40D 


OEH 


HOA 


ADA 


(U/mg) (U/mg) (U/mg) (mU/mg) 


(mU/mg) 


560 


3.5 


42 


160 


61 fc (140) 


<1 


<0.02 


0.2 


7.6 


- (-) 


<1 


<0.02 


<0.01 


6.3 


- 


290 


0.62 


10 


39 


58 


310 


0.74 


11 


44 


59 


<1 


2.0 


18 


120 


79 


<1 


<0.02 


3.3 


8.4 




<1 


<0.02 


<0.01 


7.0 




<1 


<0.02 


<0.01 


6.4 


90 


<1 


<0.02 


<0.01 


5.0 




<1 


<0.02 


<0.0i 


6.1 




61 


<0.02 


<0.01 


6.5 




<1 


1.9 


28 


7.9 




<1 


<0.02 


<0.01 


104 


62 



a Enzyme assays were performed with crude extracts prepared as described 
in Materials and Methods. The figures represent the averages of duplicate 
determinations. Each experiment was performed a minimum of two times. -, 
not detectable. Enzyme abbreviations are as defined in the legend to Fig. 1. 

b The reaction rate decreased very rapidly. The enzyme activity with a 
10-fold-higher concentration of NAD + in the reaction mixture is given within 
parentheses. 



dmpF gene product, acetaldehyde dehydrogenase (acylat- 
ing). Experiments with a strain in which only dmpF has been 
deleted from the entire pathway suggest that this might be 
the case, since, in comparison with results from a strain 
expressing the complete pathway, both aldolase and acetal- 
dehyde dehydrogenase (acylating) activities are undetect- 
able, whereas all other lower pathway enzyme levels are 
essentially identical (unpublished data). 

dmpF encodes acetaldehyde dehydrogenase (acylating) ac- 
tivity. The fate of acetaldehyde (or propionaldehyde, in the 
case of 4-methyI-substituted catechols), which formed by the 
action of 4-hydroxy-2-oxovalerate aldolase, has not been 
rigorously determined. The presence of alcohol dehydroge- 
nase activity in phenol-grown P. putida U was noted (12), 
and this represents one possible mechanism for metabolizing 
the aldehyde formed by the action of the aldolase. Alcohol 
dehydrogenase activity was also found, by using acetalde- 
hyde or propionaldehyde as a substrate, in crude extracts 
from Pseudomonas sp. strain CF600 and was present at 
similar levels in extracts from cells grown at the expense of 
either phenol or acetate (data not shown). Although this 
noninducible enzyme may metabolize the aldehyde formed 
in the mera-pathway, an operon-encoded enzyme would 
provide a mechanism for greater control over the fate of this 
metabolite. 

The dmpF gene encodes a previously undescribed operon- 
encoded enzyme activity of the /new-cleavage pathway. A 
strain harboring pVI3UA, which encodes dmpF alone, ex- 
presses a coenzyme A-dependent acetaldehyde dehydroge- 
nase (Table 3); no activity was observed in the absence of 
coenzyme A (data not shown). This activity was also found 
in two other strains that harbor plasmids expressing dmpF 
together with other genes of the pathway (pVI303A and 
pVI316A, Table 3). No enzyme activity was detected in 



extracts from a strain harboring pVI310A, which also en- 
codes dmpF\ however, the expression levels from this 
plasmid are very low (Fig. 4B, compare lanes 6 and 13). At 
low levels, this enzyme would be difficult to detect because 
of the presence of any competing alcohol dehydrogenase 
activity. This was a particular problem when measuring 
coenzyme A-dependent acetaldehyde dehydrogenase activ- 
ity in extracts from Pseudomonas sp. strain CF600, which 
contained much higher levels of competing alcohol dehy- 
drogenase activity compared with the low levels observed in 
E. coli strains (data not shown). Despite this problem, 
inducible coenzyme A-stimulated acetaldehyde dehydroge- 
nase activity could be detected in phenol-grown but not 
acetate-grown wild-type CF600 cell extracts (Table 3). How- 
ever, NADH production tapered off to zero almost immedi- 
ately after the start of the reaction; although reaction time 
could be extended by rising a 10-fold higher NAD + concen- 
tration, NADH production was still markedly nonlinear. 
Obviously, NADH produced by acetaldehyde dehydroge- 
nase (acylating) can be consumed by the reduction of excess 
acetaldehyde by alcohol dehydrogenase present in the ex- 
tract. Such an event would account for the quick tapering off 
of reactions observed for phenol-grown extracts. 



DISCUSSION 

The entire phenol/3 ,4-dmp catabolic pathway of Pseudo- 
monas sp. strain CF600 is encoded in a single operon 
consisting of 15 genes, dmpKLMNOPQBCDEFGHl , with 
intergenic spacing that varies between 0 and 70 bp. The first 
six genes are involved in the conversion of phenol to 
catechol, and the remaining genes encode enzymes of the 
me/c-cleavage pathway. This genetic organization is dif- 
ferent than that found in two other phenol-degrading 
pseudomonads, P. putida U (5. 6) and P. pickettii PKOl 
(26), in which the enzymes foi conversion of phenol to 
catechol are encoded separately from their respective oper- 
ons encoding the weta-cleavage pathway enzymes. 

Computer-assisted data base searches with the six newly 
determined meta-cleavage enzyme sequences of dmpQ, £, 
F, G. A/, and / did not reveal any striking homology to any 
genes of known function. However, the dmpG gene, which 
encodes 4-hydroxy-2-oxovalerate aldolase activity, showed 
approximately 20% identity over the entire length of the 
protein with the niJV gene product from various sources (1) 
and with a-isopropylmalate synthase from Salmonella typhi- 
murium (35). The niJV gene appears to encode homocitrate 
synthase, which catalyzes aldol condensation of acetyl co- 
enzyme A with a-ketoglutaric acid (22), whereas a-isopro- 
pylmalate synthase carries out condensation of acetyl coen- 
zyme A and a- ketoisocitrate (44). Despite the differences in 
substrate structure and specificity, these condensation reac- 
tions are mechanistically simply the reverse of the 4-hy- 
droxy-2-oxovalerate aldolase-catalyzed reaction. Thus, it is 
not surprising that these enzymes appear to be related. 

The dmpF gene product encodes acetaldehyde dehydrog- 
enase (acylating) activity. An enzyme (E.C.1.2.1.10) cata- 
lyzing this reaction was isolated from Clostridium kluyveri 
(10), but sequence information was not reported. The de- 
duced amino acid sequence of the dmpF gene exhibited a 
short region of homology with a number of dehydrogenases. 
The region of homology (Fig. 5A) coincides with a 0a0 fold 
fingerprint identified for ADP binding (47). The degree of 
agreement with the fingerprint (10 of 11 amino acid residues) 
and the requirement for the cofactor NAD + for enzymatic 
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A) 

dnpF 6 KVAIIGSGNIGTDLMIKVLRNAKYLEHGAMV 36 



<fopQ MNRAOTIRETVSGOJiTlCLPDQSVl^AMEOj}GKRCmG&G 55 
nahORF HS-EVFEITVQPCCERFV^K^ 54 
GTOJCHKHSCNHTOPEAAKO^l^ 112 
GDYESGRVSCKHUVEAREQGYAIACRU'ARSDLC 108 

C) 

dnipE MDKI L I NELC DELYQAMVNREAVS PLTERGLD 1 S VD DAYHISLRMLERflLAAGEKVlGKKIGV 63 

drcpH MNRTLTRDQVLA1AEH I ENAELDVHD I PKVTNDYPDMTFADAYDVQWE IRRRKEARGNIOTGLKMGL 67 
TSKAVQ^MliM^QPDFX^TDRMVFNSGEAHPISQLXMQPKAEGEVAFI LKKDLIGFGVTNADVLA 129 
TSWAKMAQHGVETPIYCFLVDYFSVPDOT 132 
ATECTOTCmVDSRIRDWUKIQDTVAD!^ 195 
ATDFV I PTVEVI DSRYENFKFDLI SWADHAS STRF2TGGQHANVADU)I£11jGV\nffi^GE^^L 198 
GAGAAAlXSSPVNCVAWUmTjGRFGIADCAGEVII^GSLVPI^ 261 
GAGAAVLCHPASSVAHIANLLAERGCHi PAGSFIHT^I TAAV^AP^IQT^YQGLCSVSAI^ 264 

FIG. 5. Primary amino acid sequence comparisons. Dashed lines indicate the locations and sizes of gaps inserted into the sequence. 
Identical amino acids are marked with colons, and chemically similar amino acids are marked with single dots. Amino acids considered to 
be similar are A, S, and T; D and E; N and Q: R and K; h L, M, and V; and F, Y, and W. (A) Comparison of the N-terminal region of the 
deduced amino acid sequence of the dmpF gene product and a (Jap fold fingerprint region identified for ADP binding (45). Symbols: A, basic 
or hydrophobic <K, R, H, S, T, Q, N); □, small and hydrophobic (A, I, L. V, M, C); O. acidic (V, M). (B) Comparison of the deduced amino 
acid sequence of the dmpQ gene product and an analogously located ORF in the naphthalene me fa-cleavage operon of NAH7 (47). Asterisks 
indicate cysteine residues with the same spacing as those found in [2Fe-2S] center-containing plant ferredoxins (43). Fifty-seven (52%) of 109 
residues were found to be identical. (C) Comparison of the deduced amino acid sequences of the dmpE and dmpH gene products: 97 (37%) 
of 264 residues were found to be identical. 



activity strongly suggest that this region participates in 
NAD + binding. 

Acetaldehyde dehydrogenase (acyiating) activity encoded 
by dmpF provides an operon-controlled mechanism for the 
metabolism of one of the products of the aldolase. The action 
of the aldolase results in the production of pyruvate, which 
can readily feed into the tricarboxylic acid cycle, and a 
short-chain aldehyde that requires further modification to 
produce a central metabolite. In the case of catechol or 
3-methy (catechol, acetaldehyde would be produced, 
whereas propioualdehyde would be generated by the action 
of /we/a-cleavage enzymes on 4-methycatechoI or 3,4-dime- 
thylcatechol. Since the dmpF-encoded enzyme can use both 
of these aldehydes as substrates (unpublished data), it may 
be responsible for their further metabolism. Metabolism via 
this route would be energetically favorable, since it results in 
the production of NADH and the formation of an acyl 
coenzyme A ester that can readily be metabolized further. 

A complicating factor in establishing the metabolic fate of 
acetaldehyde and propionaldehyde is the presence of non in- 
ducible, aldehyde-dependent NADH oxidation activity in 
crude extracts from Pseudomonas sp. strain CF600. It is 
most probable that this activity is due to alcohol dehydrog- 
enase (E.C.I. 1.1.1), whiui has also been shown to be 
present in crude extracts of phenol-degrading P. panda U 
(12). It is likely that this activity competes with the dmpf - 
encoded enzyme for any aldehyde that is formed, although 
to what extent it is important in the dissimilation of alde- 
hydes is not clear. Due to the high NAD + /NADH ratio 
generally present inside the cell (42), it is probable that 



reduction of acetaldehyde by alcohol dehydrogenase is ther- 
modynamically unfavorable, whereas this ratio would favor 
the acetaldehyde dehydrogenase (acyiating)-cataiyzed reac- 
tion. However, additional information regarding the kinetic 
constants of the two enzymes is required to assess their 
relative importance. Despite the possible competition be- 
tween these two aldehyde-metabolizing enzymes, the pres- 
ence of alcohol dehydrogenase activity in itself does not 
necessarily indicate that another route for aldehyde dissim- 
ilation exists. In the absence of other alcohol-metabolizing 
enzymes, the reversible nature of the alcohol dehydroge- 
nase-catalyzed reaction would mean that, as acyl coenzyme 
A is consumed, alcohol could be converted to the acyl 
coenzyme A derivative via the reaction catalyzed by the 
</mpF-encoded enzyme (10). In Results we also discuss the 
possibility that the coenzyme A-dependent dehydrogenase 
and the aldolase that produces the aldehyde are associated in 
some way. Such an association might represent a mechanism 
by which the aldehyde can be sequestered for metabolism 
via the dmpF gene product. Clearly, these reactions and the 
enzymes that catalyze them warrant more detailed studies. 

Detailed study of the organization and function of a 
number of catabolic pathways has revealed that the order 
and nucleotide sequence of mem-pathway genes from dif- 
ferent catabolic pathways can be highly conserved (2). This 
has ieu iu iiic tuea that the catabolic pathways rr.ay have 
evolved by inheritance of pre-evolved metabolic modules 
(11). Table 4 demonstrates the similarities between the sizes 
and organizations of the mefa-cleavage pathway genes of the 
dmp operon of pVI150 and those of the xyl genes of TOL 
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TABLE 4. Comparison of the organizations of the dnp and 
xyl meta cleavage pathway genes" 



Gene 



dmpQ 
dmpB 
dmpC 
dmpD 
dmpE 
dmpF 
dmpG 
dmpH 
dmp! 



Molecular 
mass of 
protein 

\KUaf 


Enzyme 
encoded 


Gene 


MOiecuiar 
mass of 
protein 
(kDa} 


12.0 


Unknown 


xxlT 


12.0 


32.0 


C230 


xxlE 


36.0 


50.0 


2HMSD 


xxlG 


60.0 


30.0 


2HMSH 


xxlF 


34.0 


28.0 


OEH 


xxU 


28.0 


35.0 


AIM 


xxlQ 


42.0 


39.0 


KOA 


xxtK 


39.0 


28.5 


40D 


xxl! 


27.0 


6.7 


40I 


xxlH 


4.0 



Enzyme 
encoded 



Unknown 

C230 

2HMSD 

2HMSH 

OEH 

Unknown 

HOA 

40D 

401 



a Molecular mass estimates and correlations of polypeptides to enzyme 
functions are from reference 19; enzyme abbreviations are as defined in the 
legend to Fig. 1. 



plasmid pWWO. The gene order of the naphthalene meta- 
cleavage pathway of NAH7 is the same except that the 
relative order of last two genes (encoding 40D and 401) is 
reversed (21, 46). The order of the xyl and dmp genes is 
identical, and the gene product sizes, as determined by 
relative mobility on gels, are similar (Table 4). The largest 
difference in gene product size is observed between dmpF\ 
which encodes an acetaldehyde dehydrogenase (acylating), 
and xylQ, an analogously located gene of unknown function 
within the xyl operon. Given the similarity of function of all 
the other me/a-cleavage pathway genes, it would appear 
Mkely, despite the apparent difference in size, that xylQ 
encodes an isofunctional enzyme. The products oidmpl and 
xylH, which both encode 401, also appear to be different in 
size. However, we believe that this may be primarily a 
reflection of the PAGE systems and standards used to 
determine the apparent molecular mass, since the dmpl 
product appears as a 3.5-kDa band on tiicine gels, but as a 
6.7-kDa band on 20% or 10 to 20% or 15 to 20% gradient 
SDS-PAGE gels (data not shown). 

The function of dmpQ is unknown, but coding regions of 
the same size and location are present in the me/a-cleavage 
pathway cperons of pWWO and NAH7 (Table 4) (19, 47). 
Comparison of the deduced amino acid sequences of dmpQ 



and the analogously located gene of NAH7 reveals 52% 
identity. The dmpQ gene product, like that of NAH7, has 
homology to ferredoxins and contains cysteine residues with 
the same spacing as the [2Fe-2S] center ligands in plant-type 
ferredoxins (Fig. 5B). These features suggest a possible role 
for the dmpQ product as an electron carrier for some step in 
the catabolism of phenol and/or its methyl-substituted deriv- 
atives. 

Sequence identity between the genes encoding C230 of the 
three operons, dmpB. xy/£, and nahH is between 83 and 87% 
on the amino acid level (4). This figure is significantly higher 
than that found for dmpQ. The results of data base searches 
with the sequences ofdmpC and dmpD have been discussed 
(31). Althou gh nucleotide sequence data for the isofunctional 
genes from ihe xyl and nah operons are not available in the 
data bases, our recent searches for homology to the dmpD 
gene product (2-hydroxymuconic semialdehyde hydrolase) 
identified partial ORFs upstream from the coding regions for 
the toluene dioxygenase genes of P. putida Fl (48) and 
benzene dioxygenase genes of P. putida (24). The degrees of 
identity found between the C-terminal portion of 2-hydroxy- 
muconic semialdehyde hydrolase and the deduced amino 
acid sequence of the 3' partial ORFs were 65.1 and 62.4% in 
a 175- and 101-amino acid overlap, respectively. Thus, 
although the catabolic pathways of pVI150, pWWO, and 
NAH7 may have inherited their wf/fl-cleavage genes as an 
ancestral metabolic module, in other cases, such as toluene 
catabolic pathway of P. putida Fl and the benzene catabolic 
pathway, this apparently has not occurred or the genes have 
subsequently been reshuffled. Alternatively, and more likely 
in view of the conserved sequence and gene order between 
these two catabolic pathways (48), they may represent 
divergence from an alternative metabolic module. 

The gene organization of the mera-cleavage pathway of 
Pseudomonas pickettii PKOl, which can grow at the ex- 
pense of toluene, benzene, cresols, and phenol, was recently 
described (26). The organization c f the wi^a-cleavage path- 
r.~«~c- A;fr<*r>c fw*m thit Hpcpi iKpH ahnvp for TQL and 

pVI150; the relative order of the genes encoding 2-hydroxy- 
muconic semialdehyde dehydrogenase and 2-hydroxymu- 
conic semialdehyde hydrolase is reversed, and the gene 
encoding 2-oxopent-4-enoate hydratase is located at the end 
of the operon. Some enzymes of this w^ra-cleavage pathway 
also appear to differ fro.rt those of TOL and pVU50 in that 



TABLE 5. Homology within the dmp operon: comparison of primary anvno acid sequences 



7c Identity" with: 



Gene 



dmpK dmpL dmpM dr V N dmpO dmpP dm„Q dmpB dmpC dmpD dmpE dmpF dmpG dmpH dmpl- 



dmpK 
dmpL 
dmpM 
dmpN 
dmpO 
dmpP 
dmpQ 
dmpB 
dmpC 
dmpD 
dmpE 
dmpF 
dmpG 
dmpH 
dmpl 



100.0 



7.6 
100.0 



10.0 
10.0 
100.0 



8.7 
5.4 
8.9 
100.0 



12.0 
8.4 
6.7 
8.4 
100.0 



10.9 
7.6 

12.2 
5.7 
7.6 
100.0 



4.3 
7.1 
8.9 
8.0 
8.9 
11.6 
100.0 



5.4 
7.8 
8.9 
6.2 
5.9 
6.5 
5.4 
100.0 



7.6 
5.4 
7.8 
6.8 
5.0 
9.1 
8.0 
5.9 
100.0 



5.4 
6.7 
7.8 
4.9 
9.2 
8 1 
9.8 
6.0 
6.7 
100.0 



5.4 
5.7 

13.3 
6.1 
8.4 
6.5 
7.1 
7.7 

11.1 
8.0 
100.0 



7.6 
7.1 
3.9 
7.1 
8.4 
7.4 

10.7 
5.5 
9.9 
9.2 
8.4 

100.0 



14.1 
7.6 
7.8 
7.2 
6.7 

10.4 
8.0 
7.2 
6.1 

10.6 
7.3 
7.4 
100.0 



7.6 
8.0 
10.0 
7.2 
6.7 
6.4 
5.4 
8.0 
6.8 
9.1 
30.3 
8.0 
9.1 
100.0 



U-l 
9.5 

12.7 
7.9 
6.4 

11.1 

12.7 
7.9 

14.3 
9.5 

11.1 

12.7 
6.4 

11.1 
100.0 



' Percent identity found by using the PC-GENE software program Palign with gap and unit gap costs of 100. 
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4-oxalocrotonate decarboxylase can apparently function in 
the absence of 2-oxopent-4-enoate hydratase (see below). 
These results suggest earlier divergence of this catabolic 
pathway. 

Csne duplication and subsequent divergence make up one 
mechanism by which different enzymatic activities might 
evolve. To investigate whether such a mechanism played a 
role in the formation of the dmp catabolic operon, the amino 
actd sequences of each of the 15 gene products of the 
pathway were compared. The PC-Gene software program 
Palign and the Wisconsin Genetics Computer Group soft- 
ware program Bestfit gave similar qualitative results. The 
results with the Palign program are shown in Table 5. 

From the results shown in Table 5, the only obvious case 
in which ancestral gene duplication may have resulted in 
divergence to two different enzymatic activities involves the 
products of dmpE and dmpH, which encode 2-oxopent-4- 
enoate hydratase and 4-oxalocrotonate decarboxylase, re- 
spectively. These two proteins, when optimally aligned, 
show 37% identity; the greatest homology is in the C-termi- 
nal portions of the sequences (Fig. 5C). As described in 
Results, a strain harboring a plasmid expressing dmpE alone 
exhibited 2-oxopent-4-enoate hydratase activity, whereas 
expression of dmpH in isolation did not yield 4-oxalocroto- 
nate decarboxylase activity. 4-Oxalocrotonate decarboxyl- 
ase activity was only detected when dmpE and dmpH were 
coexpressed in the same cell and could not be restored by 
mixing crude extracts from strains expressing each of the 
genes in isolation. These data are consistent with the findings 
of Haryama et al. (20). They found that the two isofunctional 
enzymes from the toluene catabolic pathway copurify 
through three chromatography steps and probably form a 
complex within the cell. Similarly, these workers also found 
that expression of the two proteins separately results in 
active 2-oxopent-4-enoate hydratase but loss of 4-oxaiocro- 
tonate decarboxylase activity. Their data are consistent with 
a tetrameric association of the two proteins in a complex. It 
is possible that homomultimers or monomers of 2-oxopent- 
4-enoaie hydratase retain activity, whereas heteromuitimers 
of 2-oxopent-4-enoate hydratase and 4-oxalocrotonate de- 
carboxylase are required for 4-oxalocrotonate decarboxyl- 
ase activity. 

Harayama et al. (20) postulate that the physical associa- 
tion of 2-oxopent-4-enoate hydratase and 4-oxalocrotonate 
decarboxylase might ensure efficient transformation of the 
unstable 2-oxopent-4-enoate hydratase substrate (Fig. 1, 
compound VI). They also point out that *his intermediate 
can be produced and efficiently metabolized by the hydro- 
lytic branch of the mira-cleavage pathway (Fig. i, step 3) in 
the absence of any apparent physical association of the 
enzymes involved. The common ancestry and structural 
similarity of 2-oxopent-4~enoate hydratase and 4-oxalocro- 
tonate decarboxylase might explain why a mechanism in- 
volving physical association of enzymes evolved for only the 
4-oxalocrotonate branch of the pathway. 
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ADDENDUM 

During the processing of the manuscript, nucleotide se- 
quences of four vy/ genes of the TOL-encoded meta<\eav- 
age pathway were published: jry/7\ which is analogous to 
dmpQ (18a), and xylGFJ* which is analogous to dmpCDE 
(22a). Furthermore, the ORF upstream of the toluene diox- 
ygenase genes of P. putida Fl, todF, was characterized 
(28a), and the deduced amino acid sequence was aligned 
with that of dmpD. 
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